We're Not in Kansas Anymore: Detecting Domain Changes in Streams
نویسندگان
چکیده
Domain adaptation, the problem of adapting a natural language processing system trained in one domain to perform well in a different domain, has received significant attention. This paper addresses an important problem for deployed systems that has received little attention – detecting when such adaptation is needed by a system operating in the wild, i.e., performing classification over a stream of unlabeled examples. Our method uses Adistance, a metric for detecting shifts in data streams, combined with classification margins to detect domain shifts. We empirically show effective domain shift detection on a variety of data sets and shift conditions.
منابع مشابه
"We're Not in Kansas Anymore"
Wi en Dorothy awoke in the Land of Oz, she iscovered a Technicolor land full of wonrous things and people. She thought she was "over the rainbow" and "someplace where there isn't any trouble." But she soon learned that the Land of Oz could be a dangerous place and that to navigate it successfully would require a commitment to purpose plus considerable ingenuity. After reflecting on graduate edu...
متن کامل"I don't think we're in Kansas anymore".
Persona focused user experiences Archetypes not stereotypes
متن کاملToto, We're Not in Kansas Anymore: On Transitioning from Research to the Real (Invited Industrial Talk)
Over the past five years or so, I’ve moved from academia to industrial research and now to an Internet startup. In this talk, I’ll share some of the experiences I’ve had and some of the things that I’ve observed and learned along the way. Among the issues to be discussed are some of the technical and nontechnical “real-world constraints” that I’ve bumped into in trying to apply research-y think...
متن کاملCompressed Domain Scene Change Detection Based on Transform Units Distribution in High Efficiency Video Coding Standard
Scene change detection plays an important role in a number of video applications, including video indexing, searching, browsing, semantic features extraction, and, in general, pre-processing and post-processing operations. Several scene change detection methods have been proposed in different coding standards. Most of them use fixed thresholds for the similarity metrics to determine if there wa...
متن کاملDetecting Changes in Unlabeled Data Streams Using Martingale
The martingale framework for detecting changes in data stream, currently only applicable to labeled data, is extended here to unlabeled data using clustering concept. The one-pass incremental changedetection algorithm (i) does not require a sliding window on the data stream, (ii) does not require monitoring the performance of the clustering algorithm as data points are streaming, and (iii) work...
متن کامل